AKF-SR: Adaptive Kalman filtering-based successor representation
نویسندگان
چکیده
To understand animals’ behavior in finding relations between similar tasks and adapting themselves to changes the tasks, it is necessary know how brain generalizes learned knowledge from a previous task unseen tasks. Recent studies neuroscience suggest that Successor Representation (SR)-based models provide adaptation goal locations or reward function faster than model-free algorithms, together with lower computational cost compared of model-based algorithms. However, not known such representation might help animals manage uncertainty their decision making. Existing methods for SR learning based on standard temporal difference (e.g., deep neural network-based algorithms) do capture about estimated SR. In order address this issue, paper presents Kalman filter-based framework, referred as Adaptive Filtering-based (AKF–SR). First, approach, which combination filter method, used within AKF–SR framework cast procedure into filtering problem benefit estimation SR, also decreases memory requirement sensitivity model’s parameters comparison An adaptive approach then applied proposed tune measurement noise covariance mapping most important affecting filter’s performance. Moreover, an active method exploits form behaviour policy leading more visits less certain values improve overall performance agent terms received rewards while interacting its environment. Experimental results three reinforcement environments illustrate efficacy over state-of-the-art frameworks cumulative reward, reliability, time cost, speed convergence function.
منابع مشابه
Adaptive Kalman Filtering
The increased power of small computers makes the use of parameter estimation methods attractive. Such methods have a number of uses in analytical chemistry. When valid models are available, many methods work well, but when models used in the estimation are in error, most methods fail. Methods based on the Kalman filter, a linear recursive estimator, may be modified to perform parameter estimati...
متن کاملAdaptive Observer and Kalman Filtering
In this paper the problem of the speed estimation of an Unmanned Aerial Vehicle is addressed, when only the standard outputs (acceleration, angles and angular speeds) are available for measurement. We focus our analysis on a prototype drone a 4 rotors helicopter robotwhich is not equipped with GPS related devices and relies on the Inertial Measurement Unit (IMU) only. Two different approaches h...
متن کاملSingle Channel Adaptive Kalman Filtering – Based Speech Enhancement Algorithm
This paper deals with the problem of speech enhancement when a corrupted speech signal with an additive Gaussian white noise is the only information available for processing. Speech enhancement aims to improve speech quality by using various algorithms. The objective of enhancement is improvement in intelligibility and/or overall perceptual quality of degraded speech signal using audio signal p...
متن کاملBlind adaptive multiuser detection based on Kalman filtering
Although several Kalman filtering algorithms have been presented for adaptive multiuser detection, none is “blind” due to requiring training data sequences and/or more knowledge than the spreading waveform and delay of the desired user. This paper proposes a novel blind adaptive multiuser detector based on Kalman filtering and compares it with previously published LMS and RLS algorithms for bli...
متن کاملAdaptive Kalman Filtering through Fuzzy Logic
− In this paper a development of an adaptive Kalman filter through a fuzzy inference system (FIS) is outlined. The adaptation is concerned with the imposition of conditions under which the filter measurement noise covariance matrix R or the process noise covariance matrix Q are estimated. The adaptive adjustment is carried out using a FIS based on the whiteness of the filter innovation sequence...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Neurocomputing
سال: 2022
ISSN: ['0925-2312', '1872-8286']
DOI: https://doi.org/10.1016/j.neucom.2021.10.008